Visual Emotion Recognition Using ResNet
نویسندگان
چکیده
منابع مشابه
Wider or Deeper: Revisiting the ResNet Model for Visual Recognition
The trend towards increasingly deep neural networks has been driven by a general observation that increasing depth increases the performance of a network. Recently, however, evidence has been amassing that simply increasing depth may not be the best way to increase performance, particularly given other limitations. Investigations into deep residual networks have also suggested that they may not...
متن کاملAudio-Visual Spontaneous Emotion Recognition
Automatic multimodal recognition of spontaneous emotional expressions is a largely unexplored and challenging problem. In this paper, we explore audio-visual emotion recognition in a realistic human conversation setting—the Adult Attachment Interview (AAI). Based on the assumption that facial expression and vocal expression are at the same coarse affective states, positive and negative emotion ...
متن کاملSpeaker-dependent audio-visual emotion recognition
This paper explores the recognition of expressed emotion from speech and facial gestures for the speaker-dependent case. Experiments were performed on an English audio-visual emotional database consisting of 480 utterances from 4 English male actors in 7 emotions. A total of 106 audio and 240 visual features were extracted and features were selected with Plus l-Take Away r algorithm based on Bh...
متن کاملPhysio-visual data fusion for emotion recognition
Several approaches have been proposed to recognize human emotions based on facial expressions or physiological signals, relatively rare work as been done to fuse these two, and other, modalities to improve the accuracy and robustness of the emotion recognition system. In this paper, we ropose two methods based on feature-level and decision-level to fuse facial and physiological modalities. At f...
متن کاملNoise Analysis in Audio-Visual Emotion Recognition
This paper describes the use of a decision-based fusion framework to infer emotion from audiovisual feeds, and investigates the effect of noise on the fusion system. Facial expression features are constructed from linear binary patterns, and are processed independently of the prosodic features. A linear support vector machine is used for the fusion of the two channels. The results show that the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceeding of the Electrical Engineering Computer Science and Informatics
سال: 2018
ISSN: 2407-439X,2407-439X
DOI: 10.11591/eecsi.v5i5.1700